基于鸟瞰图的空间-通道注意力多传感器融合

doi:10.16451/j.cnki.issn1003-6059.202311006

摘要
图/表
参考文献(0)
相关文章 (15)

全文: PDF (1983 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要

基于鸟瞰图(Bird's-Eye View, BEV)的目标感知算法现正成为研究热点,但少有针对BEV的多传感器融合.因此,文中提出基于空间-通道注意力的多传感器融合模块,对不同模态的特征数据增加局部注意力机制,能有效修正多传感器之间的空间误差.利用转置注意力操作,充分融合图像和点云数据,消解不同模态语义信息之间的异质性,使融合的BEV特征在不引入空间偏差的同时,有效结合多传感器各自的独特信息,实现更全面准确的感知.在nuScenes数据集上的实验以及大量的消融实验表明,文中融合模块能有效提升目标检测算法的精度,可视化结果展现融合后的特征具有更完整、准确的特征信息,可明显提升对远处物体的检测.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	吉宇哲
	陈奕洁
	杨柳青
	郑心湖

关键词 ：鸟瞰图(BEV), 多传感器融合, 注意力机制, 目标检测

Abstract：

Object perception based on bird's-eye view(BEV) is one of hot issues, but studies on multi-sensor fusion for BEV are still insufficient. Therefore, a multi-sensor fusion module based on spatial-channel attention is proposed. Spatial errors between multiple sensors can be effectively corrected by adding local attention mechanisms to features of different modalities. By using transpose attention operations, the image and point cloud data are fully integrated to resolve the heterogeneity between different modal semantics. Consequently, the fused BEV features achieves more comprehensive and accurate perception by effectively combining the unique information of each sensor without introducing spatial misalignment. Experiment on nuScenes dataset and extensive ablation experiments show that the proposed fusion module effectively improves the accuracy of object detection. Visualization results demonstrate that the fused features can capture more complete and accurate information, especially in distant objects detection.

Key words： Bird's-Eye View(BEV) Multi-sensor Fusion Attention Mechanism Object Detection

收稿日期: 2023-10-18

ZTFLH:

TP391.41

基金资助:

国家自然科学基金面上项目(No.62373315)、广州市科技计划项目(No.2023A03J0683,2023A03J0011)资助

作者简介: 吉宇哲,博士研究生,主要研究方向为多模态融合感知、多车协同感知.E-mail:yji755@connect.hkust-gz.edu.cn.陈奕洁,硕士研究生,主要研究方向为多智能体协同感知、车路协同系统.E-mail:ychen324@connect.hkust-gz.edu.cn.杨柳青,博士,教授,主要研究方向为无线通信网络、多智能体系统、通讯感知一体化等.E-mail:lqyang@ust.hk.郑心湖,博士,助理教授,主要研究方向为多模态感知、多智能体协同感知、网联智能.E-mail:xinhuzheng@ust.hk.

引用本文:

吉宇哲, 陈奕洁, 杨柳青, 郑心湖. 基于鸟瞰图的空间-通道注意力多传感器融合[J]. 模式识别与人工智能, 2023, 36(11): 1029-1040. JI Yuzhe, CHEN Yijie, YANG Liuqing, ZHENG Xinhu. Spatial-Channel Attention Multi-sensor Fusion Based on Bird's-Eye View. Pattern Recognition and Artificial Intelligence, 2023, 36(11): 1029-1040.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202311006 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2023/V36/I11/1029